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ABSTRACT: 

System for simultaneous and selective invalidation of the entries of a data cache (2) cooperating 
with an address cache (1), following an address cache entry invalidation, comprising for each 
data cache entry, one-to-one correlated with an address cache entry by one and the same 
shifted address, associated in the address cache with an address to be shifted, and in the data 
cache with a data line, a field for storage of an address cache entry selection index IDX (1 ...m) so 
that by comparing each of the selection indices IDX (1...m), stored in the data cache, with a state 
vector, output by the address cache and representative of the state of each of the address cache 
entries, executed by comparison logic (16) it is possible selectively to invalidate the data cache 
entries, as soon as the address cache entries with which they are one-to-one correlated are 
invalidated. 
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(54) System for selective and joint invalidation of entries of two, respectively address and data 
caches, caused by one or more address cache selective Invalidations 

(57) System for simultaneous and selective invali- 
dation of the entries of a data cache (2) cooperating with 
an address cache (1), following an address cache entry 
invalidation, comprising for each data cache entry, one- 
to-one correlated with an address cache entry by one 
and the same shifted address, associated in the 
address cache with an address to be shifted, and in the 
data cache with a data line, a field for storage of an 
address cache entry selection index IDX (1...m) so that 
by comparing each of the selection indices IDX (1...m), 
stored in the data cache, with a state vector, output by 
the address cache and representative of the state of 
each of the address cache entries, executed by compar- 
ison logic (16) it is possible selectively to invalidate the 
data cache entries, as soon as the address cache 
entries with which they are one-to-one correlated are 
invalidated. 
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Description 

The present invention relates to a jointly operating 
data and address cache system, which maintains the 
data consistency relationship in the indirect addressing s 
of a paged memory space. 

In modern data processing systems, the memory is 
not directly visible but is organized into segments, gen- 
erally but not necessarily with dimensions of 4 Kbytes 
each, termed pages, and the addressing in effected in 
indirect mode. 

This is essential in order to make the logical loca- 
tion of the information independent, this information 
being gradually retrieved in the course of processes, 
from their actual physical location in a memory space. 

The manner of addressing commonly used by a 
processor or an I/O unit (input output) for accessing a 
memory space, makes provision to employ a logical (or 
virtual) address in which a field of most significant bits, 
for example from 12 to 31 , logically identifies a page of 
information, and the least significant bits, from 0 to 11 , 
commonly termed "offsets", identify an item of informa- 
tion, at byte level, within the page. 

The logical page address LPA is combined (gener- 
ally through a sum operation) with a base or TLBP 
(Translation Logic Block Pointer) pointer stored in a 
TLBR (Translation Logic Block Register) register. 

The shifted address thus obtained is used to point 
to an RPE (Real Page Entry) location inside an address 
table, termed TLB (Translation Logic Block) from which 
the actual page address or RPA (Real Page Address) is 
extracted, to be used to address the memory location 
where the information item is to be read or written. 

Clearly the real page address RPA, concatenated 
with the offset, which undergoes no modification, consti- 
tutes the real address. 

Two conclusions derive immediately from this pre- 
amble: 

1) Indirect addressing, with one or more indirect- 
nesses, is laborious and involves considerable 
losses of time. 

2) Since the real page addresses are contained in 
memory tables, which can be modified, the real 
page addresses have to be regarded as data, to all 
effects. 

To avoid the laborious process of generating real 
addresses, with arithmetic and memory access opera- 
tions, whenever it is desired to read or write a data item, 
use is made, as is known, of address caches, i.e. fast 
associative memories, in which a plurality of pairs is 
stored: logical page address and conesponding real 
page address. 

Since these address caches, also termed MMUs 
(Memory Management Units) have limited capacity, 
their content has to be renewed according to require- 
ments and substituted with LPA/RPA pairs of actual or 



envisaged use or else modified if the LPA/RPA relation 
changes (because the data have been physically trans- 
ferred into the memory space or because the table TLB 
has been transferred or modified). 

The address caches also have therefore to be sub- 
ject to a consistency protocol, albeit simplified, and a 
valid/invalid state indicator has to be associated with the 
various cache entries, this indicator being set to the 
invalid state for all the entries of the cache, if the latter is 
cleared or "flushed" to allow the information content to 
be replaced by other information, or selectively just for 
those entries which contain information which is no 
longer consistent with the table TLB or with the arrange- 
ment of the data in the memory space. 

Selective invalidation of the cache entries, so as to 
make way gradually for new information, with "replace- 
ment" operations can also be effected on the basis of 
the well-known LRU criterion for selecting the least 
recently used entries for replacement thereof, or else 
with a FIFO (First In First Out) criterion. 

The operations for invalidating the entries of an 
address cache, be they global (flushing) or selective, 
have weighty implications if, as generally happens, the 
address cache operates jointly with a data cache since 
the latter, in order to identify the presence or otherwise 
of data within it, employs the real addresses, as trans- 
lated by the address cache, and not the logical 
addresses. 

Evidently if the real addresses are no longer valid, 
there is no longer a one-to-one correlation between 
data and real addresses, and the data associated with 
addresses which are no longer valid have also to be 
invalidated. 

However, precisely because there is no longer a 
one-to-one correlation between data and real 
addresses, no logical instrument is now available which 
makes it possible to identify the data cache entries 
which have to be selectively invalidated. 

To achieve this result it would be necessary to pro- 
vide complex procedures which preface an operation for 
invalidating an entry of an address cache with a selec- 
tive operation for invalidating all the entries of the data 
cache with which is associated a real page address 
which will then be invalidated. 

These procedures would turn out to be so laborious 
that in practice selective invalidation is rejected and all 
the entries of the data cache (or of the data caches if 
more than one) are invalidated with a "flushing" opera- 
tion. 

As a consequence the data caches are latent until 
they are loaded with new information, this involving an 
inevitable slowing down of the processes in progress. 

The present invention solves this problem and pro- 
vides a system of address cache and data cache oper- 
ating jointly, which makes it possible selectively and 
jointly to invalidate the entries of the two caches when 
this is required by the invalidating of one or more entries 
of the address cache. 
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The invalidation operation is practically simultane- 
ous, with merely the propagation delays of a simple 
logic chain and is preferably carried out in two clock sig- 
nal periods of the system. 

These results are achieved by a system as defined 
in the claims 

Essentially the system comprises an address 
cache with several entries, preferably but not necessar- 
ily of the full associative type, which, unlike the known 
address caches, is provided with a first output channel 
for a state vector whose bits each represent the 
valid/invalid state of one of the entries, and with a sec- 
ond output channel for an index, indicating the entry 
selected in rotation. 

Associated with the address cache is a data cache 
in which each entry, as well as the conventional fields 
for data, for the real address associated with the data 
and for attributes (defined by a consistency protocol) 
also comprises an additional field for storing the index 
which identifies in the address cache the entry which 
contains the same real page address. 

This index, concatenated with the real page 
address and available with the latter at the output of the 
address cache, is written to the entry, when the entry is 
refilled, and correlates the entry of the data cache one- 
to-one with that of the entry of the address cache which 
contains the same real page address. 

It is therefore possible by means of this index selec- 
tively to invalidate the entries of the data cache which 
are associated one-to-one with an entry of the address 
cache which is invalidated. 

Logic comparison circuits, one for each entry of the 
data cache, are provided for this purpose, which com- 
pare the index, decoded into an index vector if not 
already stored in this form, with the state vector output 
by the address cache. 

If the index coincides with that of an invalid-state 
entry of the address cache, the entry of the data cache 
which contains this index is then also invalidated. 

Invalidation can take place asynchronously, simul- 
taneously with the invalidation of the entry in the 
address cache, with merely the delay due to the propa- 
gation times of the logic comparison network or, prefer- 
ably, synchronously with a delay equal to one period of 
a clock signal, relative to the invalidation of the entry in 
the address cache. 

The characteristics and advantages of the present 
invention will become clearer from the description which 
follows of a preferred embodiment given with reference 
to the appended drawings in which: 

Fig. 1 is a block diagram of a preferred embodiment 
of a system of caches, one for addresses, the other 
for data, operating jointly and consistently in 
accordance with the present invention; 
Fig. 2 is a construction diagram for comparison 
logic for the selective invalidation of data cache 
entries following upon the invalidation of address 



cache entries in respect of the system of Fig. 1 ; 
- Fig. 3 is a block diagram of a variant embodiment of 
an address cache for the system of Fig. 1 . 

5 With reference to Figure 1 , the system which is the 
subject of the present invention comprises an address 
cache 1 cooperating with a data cache 2. 

The two caches form part of a processing unit, for 
example a system processor or an I/O unit which reads 
10 or writes data in a generic memory space by means of 
instructions whose format is represented by the string 3. 

The instruction comprises a logical address, for 
example 32-bit, consisting of an OFFSET (bits 0-11) 
and of a logical page address LPA (bits 12-31), an oper- 
15 ation code OP which defines the operation as a 
read/write operation or suchlike. In the case of write 
operations the instruction also comprises the data to be 
written DATA. 

The function of the cache 1 is to convert the logical 
20 page address LPA into a real page address, if the latter 
is present in the cache, in the valid state, and to signal 
to the processing unit that the read or write operation is 
to be preceded by the tracing of the real page address 
in a predetermined table if this address is not present in 
25 the cache or is invalid. 

In its general aspects the cache 1 is of conventional 
and well-known type. 

It comprises an associative area 4, a random 
access RAM area 5, addressable with an index IDX and 
30 control logic 6. 

The two areas comprise a plurality of entries from 1 
to n, where for example n = 16. 

Stored in the various entries of the associative area 
are up to n logical page addresses LPA(1...n) and 
35 stored in the corresponding entries of the addressable 
area are the corresponding real page address 
RPA(1...n) each accompanied by a state bit V(1...n) 
which indicates, according to its logic level, whether the 
entry is valid or otherwise, and by other bits indicative of 
40 page attributes, which here are of no interest. 

It should be observed that the logical and real page 
addresses do not necessarily have the same number of 
bits but for example the real addresses can have a 
greater number thereof so as to address a bigger sys- 
45 tern physical space than that which is logically address- 
able. 

In the loading phase of the cache 1, the various 
pairs LPA(1...n) and corresponding RPA(1...n) are pre- 
sented to the cache via a data channel. 

so The allocating of the data to the entries and the set- 
ting of the corresponding state bit V(1...n) is managed 
by the control logic 6. 

For the conversion of logical addresses into the cor- 
responding physical addresses the logical page 

55 address of an instruction is presented via an address 
channel 8 to the associative area of the cache, and 
compared with all the addresses LPA(1...n) stored 
therein. 
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If one of then coincides with the input address, the 
associative area 4 posts a signal HIT to the control unit 
and outputs an index IDX for selecting the correspond- 
ing entry of the addressable area 5, which is read and 
outputs the corresponding address RPA, the associated 
state bit V and the bits representative of other attributes. 

The control logic having verified that the state of the 
entry is valid, and that the required operation is consist- 
ent with the page attributes, enables the outputting by 
the cache 1 of the address RPA. 

Otherwise, if the signal HIT is not posted or if the 
entry is not valid, a procedure is activated, as already 
stated, for retrieving and loading the required pair LPA, 
RPA into an entry of the cache, subject to the invalida- 
tion, if necessary, of a cache entry for the replacement 
thereof. 

The address RPA output by the cache 1 and con- 
catenated with the address offset, constitutes a real 
address, directly useable to access the memory space. 

The cache 1 is provided, for the reasons which will 
be explained later, also with a first output channel 9 for 
the simultaneous reading in parallel of the state bit 
V(1 ...n) of each of the entries. 

Together, these bits constitute a state vector which 
describes the complete state of the cache. 

It is also provided with a second output channel 10 
for the reading of the selection index IDX and the 
employing thereof outside the cache. 

These two channels are the significant aspects 
which distinguish the cache 1 from the conventional 
address caches known in the art. 

The data cache 2 also in its architecture can be 
regarded as of conventional type, except for the aspect 
specified earlier, and comprises an associative area 11 
with m entries (for example m = 8) an area 12 which is 
addressable with a selection index generated by the 
associative area, and control logic 13. 

Stored in the various entries of the associative area 
are up to a maximum of m real addresses RA(1...m), 
excluding the least significant bits depending on the 
magnitude of the data blocks or lines of the cache. 

Stored in the corresponding entries of the address- 
able area are data lines DATA LINE, in a first field and, 
in a second field, the state bits which define the state of 
the data, i.e. of the entry, in conformity with a consist- 
ency protocol, for example the protocol known by the 
acronym MES! standing for Modified, Exclusive, 
Shared, Invalid. 

For simplicity the state bits are represented in Fig- 
ure 1 as a single bit V(1 ...m) which, according to its logic 
level, defines the associated data as valid or invalid. 

Since the cache 2 does not differ through these 
aspects from the data caches known in the art, any 
detailed description of its structure and mode of opera- 
tion is superfluous. 

The significant aspect which distinguishes the 
cache 2 from those known in the art is that each cache 
entry comprises an index field 15 for storing an index 



iDX(1...m) received from the address cache via the 
channel 10. 

This index is written to the entry concurrently with 
the writing of the real address RA(1...m). 
5 Another significant aspect is that the cache 2 is pro- 
vided with an output channel 14 for the simultaneous 
reading in parallel of the index field 15 of all the entries. 

The system is completed by comparison logic 16, 
consisting of a number m of comparators equal to the 
10 number of entries of the data cache, which receives the 
indices IDX(1...m) from the various entries, compares 
them with the state vector of the address cache 1, 
received via the channel 9, and outputs, on a channel 
17 with m wires, selective-invalidation signals. 
15 The structure of the comparison logic is very simple 
and by way of illustration one of the various possible 
embodiments thereof is given in Figure 2. 

It is assumed for example that the invalid state of 
the entries of the address cache is represented by a bit 
20 V(i) at logic level 1. 

It is also assumed that the addresses IDX output by 
the data cache are represented by a binary code IDXi. 

In this case the comparison logic, for each code 
IDXi received, (the code IDX(1) in the specific case rep- 
25 resented in Figure 2) consists of a decoder 18 which 
converts the code into a bit vector IDX(1,1..n) in which 
just one of the bits is at logic level 1. 

With each bit of the vector there corresponds an 
entry of the address cache and the bit at logic level 1 
so identifies the entry selected by the code IDXI . 

The bits of the vector IDX(1,1..n) are each logic 
ANDed with the corresponding bits V1,V2,..Vn of the 
state vector, received via the channel 9, by means of n 
AND gates 19, 20, 21. 
35 The outputs of the AND gates 19, 20, 21 are logic 
ORed by the OR gate 22. 

It is evident that if the bit at logic level 1 of the vector 
IDX (1,1...n) is paired with a bit of the state vector at 
logic level 1, hence indicative of an invalid entry of the 
40 address cache, a signal KILL V(1) at logic level 1 is 
present at the output of the OR gate 22, and makes it 
possible to invalidate the entry of the data cache asso- 
ciated with the code IDX1 . 

Wholly identical is the comparison logic for the 
45 other codes IDX2, ... IDXm which generates the invalida- 
tion signals KILL V(2),..KILL V(m). 

The signal KILL V(1) can be used as the signal for 
asynchronous setting of the memory cell 23 which 
defines the invalid/valid state (depending on whether 
so set or reset) of the first entry of the data cache which 
contains the code IDX1 . 

In practice, the cache celts preferably consist of D- 
type flip-flops, hence the signal KILL V(1) is applied to 
the D input (logic ORed with other invalidation signals 
55 consequent on a consistency protocol) and the cell 
switched to the set state on command from a periodic 
clock signal CLK. 

It is therefore evident that if an entry of the address 
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cache becomes invalidated, and this occurs synchro- 
nously with a first pulse of the periodic clock signal, then 
all and only the entries of the data cache which are no 
longer consistent with the content of the address cache 
become selectively invalidated with the immediately fol- 
lowing pulse. 

Since the comparison logic 16 is very simple it is 
possible to operate with a periodic clock signal having 
high frequency of the order of 100 MHz or more. 

It should be noted that the index IDX generated by 
the address cache can already be in vectorized form or 
can be converted into vectorized form before being 
stored in the data cache. 

In this case the decoder 18 in the comparison logic 
is superfluous. 

It should also be noted that in the address and data 
cache system described, the mechanism of selective 
invalidation of entries of the data cache, as a conse- 
quence of invalidations executed in the address cache, 
which may be dubbed indirect invalidation, is additional 
to the mechanisms of direct invalidation, which are nor- 
mally provided, to allow the replacement of data, when 
necessary, and to ensure their consistency. 

The above description relates only to a preferred 
embodiment and it is clear that many variants may be 
introduced, both to the comparison logic 16 and to the 
structure of the cache. 

For example, rather than using caches of the "fully 
associative" type it is also possible to use caches of the 
so-called "set-associative" type with one or more path- 
ways, for the address or data cache or for both, if the 
limits which these types of caches put on the replace- 
ment operations and on the classes of storable informa- 
tion do not constitute an impediment. 

Fig. 3 represents a cache of this type, used as 
address cache for the system already described. 

The cache comprises a random access RAM mem- 
ory 24 with a number n of entries, comparison logic 25 
and control logic 26. 

Each entry comprises a field for storing a logical 
page address LPA(1)...LPA(n), afield for storing the cor- 
responding real page address RPA(1)...RPA(n) and a 
field for storing a state bit V(1)...V(n). ^ 

The memory 24 is addressed with an entry selec- 
tion index IDX consisting of a certain number of bits (for 
example bits 12-14) of the logical page address LPA, 
received from outside. 

The outside logical page address is compared with 
that stored in the entry selected by the index, and if 
there is coincidence, and the entry is valid, the corre- 
sponding real page address read from memory is the 
translation of the logical input address. 

In this case the index code IDX is supplied from out- 
side rather than being generated by the cache. 

The same code, in the case when converted into a 
vector IDX VECTOR by a decoder 27 can be applied as 
input to a data cache, not illustrated, as in the embodi- 
ment of Fig. 1. 
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In this case also the address cache has to be pro- 
vided with output channels 28 for the reading in parallel 
of the state vector S VECTOR. 

In the above description the address to be shifted 
5 was termed the logical address for simplicity and clarity 
and so as to distinguish it clearly from the real or shifted 
physical address. 

However, it is evident that in modern data process- 
ing systems, in particular in multiprocessor systems, it is 
10 possible to have several successive levels of shifting of 
addresses, so that logical address should be under- 
stood to mean any address which is to be shifted, by 
one or more processing units so as to address the phys- 
ical memory space. 

15 

Claims 

1. System of data cache (2) and address cache (1), 
operating jointly, in which said address cache com- 

20 prises a plurality of entries, selected by an index 
(IDX), each containing a page address to be trans- 
lated, a corresponding translated page address, 
and a bit indicating the valid/invalid state of the 
respective entry, and said data cache corrprises a 

25 plurality of entries each containing a data line, a 
respective address comprising a translated page 
address, and a respective field of state attributes 
defined by a consistency protocol, one of said 
states being an invalid state, characterized in that 

30 

said address cache comprises a first output 
channel (9) for the reading in parallel, as state 
vector, of said bit indicating the valid/invalid 
state of each of said address cache entries and 
35 a second output channel (10) for said index 

(IDX), 

said data cache (2) comprises, in each of its 
entries, an index storage field, received from 
said address cache and associated with said 
40 translated page address contained in the same 

entry, and an output channel (14) for the read- 
ing in parallel of said index, stored in each of 
said data cache entries, 

said system furthermore comprising: 

45 

logic means of comparison, of each of the 
indices output by said data cache on said 
output channel (14) with said state vector, 
received via said first output channel (9) so 

so as to recognize among said indices, 

invalid-state address cache entry selection 
indices and so as to output invalidation sig- 
nals (KILL V(1)...V(m)), for the data cache 
entries containing the indices recognized 

55 as invalid-state address cache entry selec- 

tion indices. 

2. System according to Claim 1 in which said index is 
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stored in said data cache (2) in binary vector form. 
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